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Background of the Invention 

5 This application claims priority to U. S. S. N. 60/193,888 filed March 31, 

2000. The United States government has certain rights in this invention by 
virtue of grants to Eugenia Wang from the National Institute on Aging (AG09278) 
and from the Defense Advance Research Project Agency (DARPA) of the 
Department of Defense of the United States of America. 

10 With the advent of the Human Genome Project, one is confronted with 

voluminous information demonstrating that biological systems may be 
controlled by hundreds of genes working in concert. A single glance at the ever- 
increasing number of genes involved in signal transduction makes one wonder 
just how many genes are needed to choreograph the symphonic dance of 

15 implementing a signal, from the receptor-ligand binding to the nuclear response 
of transcriptional activation. During the 1980's and early 1990's, biologists were 
busy dissecting single genes' functions from the reductionist point of view. This 
approach, while thorough in its exact methodological analysis of genetic impact, 
lacks the expanded vision of how each particular single gene functions in the 

20 context of many sister genes or partners, to accomplish a biological task. Thus, 
it is not surprising that the technology of high-throughput gene screening is 
emerging rapidly, in the attempt to identify tens or hundreds of genes whose 
changes, viewed in composite genetic signatures, define a particular 
physiological state. This gene signature approach, complemented by single gene 

25 analysis, provides a vertical, in-depth analysis of an individual gene's function, 
as well as the comprehensive picture of the pattern of gene expression in which 
the particular gene functions. The notion of genetic signature can be further 
generalized to address the question of inter-individual variance, by comparing 
individuals from cohorts of hundreds or thousands. 

30 The unfathomable task of comparing several dozens of single nucleotide 

polymorphisms (SnP) in a hundred people can now be approached easily by 
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DNA biochip technology (Wang, et al. Science 280:1077-1082 (1998)). For 
example, a p53 DNA chip is used popularly for the identification and gene 
screening of unique cancer risks, to discover new SnPs as well as screening 
known SnPs. Either task needs a fast, multiplex approach requiring data entry 
5 on the scale of hundreds and thousands, a demand that can only be met by high- 
throughput technology. The presently available microarray biochip technology 
is certainly the method of choice to solve the problem of complexity, and the 
previously impossible task of defining a genetic signature for a unique person in 
a cohort with accuracy and speed that are impossible by the conventional 

10 diagnostic approach. Therefore, from bench-side researchers to bedside 

physicians, there is intense interest in the technology of microarray analysis, for 
screening or identifying tens or hundreds of genes related to disease or normal 
states of a given person or biological system. 

cDNA and oligonucleotide microarrays are becoming an increasingly 

1 5 powerful technique for investigating gene expression patterns. In spite of the fast 
progress in this field, some limitations of the technique persist. One of the major 
obstacles is the requirement for a large amount of mRNA. Another problem with 
existing microarray systems is data mining; while information on expression of 
tens of thousands genes is absolutely vital to estimate the functions of new genes, 

20 in some instances a researcher is interested in the expression profile of only a 
subset of genes, in many physiological conditions. 

It is an object of the present invention to provide a method and materials 
for the rapid analysis of genetic information based on a common regulatory 
feature. 

25 It is a further object of the present invention to provide a method and 

materials for sensitive and quick analysis of genetic information present in very 
small quantities. 

Summary of the Invention 

Microarray technology is a fast-growing field of biomedical research, 
30 aiming to investigate changes in molecular features of hundreds of genes. The 
multiple parallel processing of information generated from matrices of huge 
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numbers of loci on a solid substrate has allowed the gathering of gene signatures 
defining specific biological states. A new approach has been developed to 
facilitate this process wherein genes of the same regulatory modality are selected. 
The transcriptional regulation of these genes is related to the same control element, 
5 the E-box, defined by the sequence CACGTG. PCR products of selected regions 
of all known genes either binding to this sequence or whose expression is 
dependent on this binding, as well as genes interacting with E-box-binding genes 
and control genes, are arrayed on a nylon membrane or other appropriate 
microchip susbstrate, which is then used as an E-box-specific microarray. The 

1 0 transcriptionally regulated profile of E-box-related genes specific to a given 

cultured cell sample is then determined by unique labeled cDNAs probes produced 
from RNAs isolated from the culture of interest. 

The production of E-box microarrays provides an approach to custom- 
adapt the gene screening task to analyze a subgroup of gene expressions controlled 

15 by the same molecular modality. E-box binding-related genes represent a specific 
group of basic helix-loop-helix/leucine zipper transcription factors, recognizing the 
core-binding site CACGTG. They play important roles in regulation of basic 
cellular functions, like proliferation and apoptosis (c-Myc) or tissue-specific 
differentiation (Myod). As demonstrated by the example, careful selection of 

20 genes for the microarray allowed extraction of E-box gene specific signatures of 
HeLa cells and normal human lymphocytes. The significant differences in 
expression of 3-6 genes out of 61 are already much more manageable than can be 
detected from ordinary microarrays with massive numbers of genes, in the 
hundreds or thousands. 

25 Brief Description of the Drawings 

Figure 1 depicts cDNA microarray hybridization for evaluation of E-box 
binding-related gene expression. The matrix position, with each gene's 
abbreviation, is written underneath each locus of three repeats of dots with 
identical amounts deposited; the X-coordinates denote the number 1, 2, 3, 4, and 5 

30 positions, and Y-coordinates denote the "a" through "o" positions. The matrix 
location for each gene triplet is then identified as X,Y coordinates. For example, 
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5k denotes the position of N-Myc, and 3d denotes of the position of Mad. The 
same coordinates are also included in Table 2. 

Figures 2 A and 2B shows the expression profiles of E-box binding-related 
gene expressions in Hela cells. Figure 2A - total RNA was labeled with 
5 digoxigenin in RT reaction with gene specific primers; Figure 2B - mRNA was 
labeled with digoxigenin in RT reaction with oligo(dT) primers. Arrows within the 
matrix show positions of: 7- Hela DNA (positive control); 77- lambda DNA 
(negative control); 777- UBC; 7F-RPL-13A; F-MBP-1; J7-HPRT1. The 
distance between dots can be measured by the bar of 1mm. 

10 Figure 3 depicts hybridization of products of multiplex PCR with 5 pair of 

primers with cDNA microarray. Arrows within the matrix point to: 7-Mrdb; 77- 
c-Myc p64; 777- TFII-1; IV- ODC1 ; V- cdc25A; VI- Hela genomic DNA. 

Figure 4 shows the relationship between concentrations of 5 genes 
including Mrdb ? c-Myc ? TFII-1, ODC1, and cdc25a ; and intensity of hybridization 

1 5 signals. Logarithmic approximation is shown. Dot intensity is represented by the 
arbitrary units on the Y-axis; concentration is measured as ng/ml on the X-axis. 

Figures 5 A and 5B show the expression profiles of E-box-related genes in 
Hela cells (Figure 5 A), and normal human lymphocytes (Figure 5). Arrows within 
the matrix show positions of: 7- Aldolase C; 77- Mad4; 777- MBP-1 . 

20 Figures 6 A. 6B and 6C are pairwise comparisons of E-box gene expression 

in Hela cells and human lymphocytes. Two independent hybridizations are 
averaged for each type of cell Figure 6A - Three-dimensional, and Figure 6B - 
two-dimensional, representations of differences in gene expression. Each panel 
corresponds to one column in Figure 5 A, and each bar represents an individual 

25 gene. Figure 6C - Distribution of genes with common gain (red) or loss (blue) of 
expression in dependence on relative ratio value. The relative fold ratio between 
samples SI and S2 is computed as 





max(SijS2) 



30 
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which yields a value in the range of [-1 ,+1 ]. Positive values correspond to up- 
regulation, and negative values correspond to down-regulation, of genes in sample 
S2. The relative fold ratio has a similar meaning to that of conventional fold ratio, 
except that the value is normalized and symmetric, with clear physical 
5 interpretation. Rzw(Si ? S2) = ±0.5 corresponds to a two-fold up- or down-regulation 
in normalizing the two samples; a set of housekeeping genes of relatively constant 
expression levels were selected as controls, and linear normalization was applied. 

Detailed Description of the Invention 
10 E-box Regulatory Genes 

The production of E-box microarrays provides an approach to custom- 
adapt the gene screening task to analyze a subgroup of gene expressions controlled 
by the same molecular modality. E-box binding-related genes represent a specific 
group of basic helix-loop-helix/leucine zipper transcription factors, recognizing the 

1 5 core-binding site C ACGTG. As used herein, E-box genes refer to all genes having 
the E-box in their promoter region, as well as E-box binding and interacting genes. 
They play important roles in regulation of basic cellular functions, like 
proliferation and apoptosis (c-Myc) or tissue-specific differentiation (Myod). As 
demonstrated by the example, careful selection of genes for the microarray allowed 

20 extraction of E-box gene specific signatures of HeLa cells and normal human 

lymphocytes. The significant differences in expression of 3-6 genes out of 61 are 
already much more manageable than can be detected from ordinary microarrays 
with massive numbers of genes, in the hundreds or thousands. For example, in 
SAGA analysis of 45,000 genes, it was found that about only 1% are differentially 

25 expressed in normal and cancerous human cells (Zhang, et aL, Science 276, 1268- 
1272 (1997)). A similar estimation resulted from analysis of expression profiles in 
young and old mice; expressions of only 1.8% of about 6,000 genes are changed 
more than 2-fold (Lee, et al., Science 285, 1390-1393 (1999)). 

The best-known representative of E-box -binding genes is c-Myc, whose 

30 transactivating activity plays crucial roles in the regulation of cell cycle, 
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proliferation and apoptosis (Eilers, Mol. Cells 9, 1-6 (1999); Dang, C.V. Mol. Cell 
Biol 19, 1-11 (1999); Facchini andPenn, FASEB J. 12, 633-651 (1998)). For this 
reason, genes interacting with or regulating expression for c-Myc, as well as some 
target genes whose expression is E-box-binding-dependent, are included in this 
5 microarray. Representative E-box genes are shown in Table 2. 
Housekeeping Genes 

Housekeeping genes are used to normalize results of expression. These are 
genes that are selected based on the relatively invariable levels of expression in the 
system which is being examined, for example, the state such as age or a particular 

10 disease. Representative housekeeping genes are shown in Table 2. These include 
tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, 
hypoxanthine phosphoribosyltransferase I (Lesh-Nyhan syndrome), Major 
histocompatibility complex, class I, C, Ubiquitin C, Glyceraldehyde-3- 
phosphate dehydrogenase, Human mRNA fragment encoding cytoplasmic actin, 

15 60S Ribosomal protein LI 3 A, and Aldolase C. 
Probes 

In the preferred embodiment, a set of primers for use in detecting changes 
in expression of genes include the E-box regulatory sequence, are between 480 
and 700 base pairs length, have a melting point between 75 and 85°C, and 
20 include non-consensus sequence with protein coding sequence, so that there is 
no detectable hybridization between homologous genes, more preferably where 
there is no hybridization between homologous genes. Examples of homologous 
genes include c-myc and c-myc associated genes. 
Diseases and States 

25 The changes in expression of the E-box regulatory genes described herein 

can be used to assess changes associated with a particular state or disease. The 
association of certain E-box genes such as c-myc with cancer and 
neurodegeneration, and its role in apoptosis, are well established. Other genes 
include yyl, myc-Ll, and myc-L2, which affect cells, cell components, and 

30 specific molecules, for example, cardiomysin, myotube, osteoblasts, and 
osteoclasts. Changes in expression of individual genes, either by turning 
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expression on or off, or altering the amount of expression, can be used to assess 
changes in states such as age or diseases associated with cancer of tissues such as 
breast, prostate, and colon, immunological changes such as inflammation, 
neurodegenerative diseases, cardiovascular disorders, and musculoskeletal 
5 disorders, including disorders and diseases of bones such as osteoarthritis and 
osteoporosis, and muscle degeneration. 
Screening 

The arrays can be tested by screening with labeled probes to determine if 
there is expression of a particular gene in the array and how much, to thereby 

10 construct a "fingerprint" of the disease or disorder at that time, using genes present 
in cells or tissues obtained from one or more individuals having the disease or 
disorder or characterized by a particular state, such as age. The effect of a 
compound or composition on the disorder or disease or state can also be assessed 
by comparing the fingerprint obtained with control cells or tissues, and cells or 

1 5 tissues treated with the compound or obtained from an animal treated with the 

compound (or compounds, or dosage regime, or exposed to particular conditions). 
This is especially useful for initial screening of the effect of potential drugs, either 
to determine potential efficacy and/or toxicity. Those compounds which appear 
promising can then be further screened to determine if they can reduce or reverse 

20 the severity of the disease or disorder. Compounds to be screened can be proteins 
or peptides, sugars or polysaccharides, nucleic acid molecules, or synthetic 
molecules. 

Microchip Array Technology and Analysis 

Information resources 

25 There are several DNA microchip technology reviews in the literature 

(Bowtell, D.D.L. Nature Genetics Supplement 21:25-32 (1999); Constantine, 
and Harrington Life Science News 1:11-13 (1998); Ramsay, G Nature 
Biotechnology 16:40-44 (1998)), and several good web sites detailing the 
apparatus and protocols used by other laboratories, nothing in the literature 

30 offers a description of a working arrangement to serve as a user-friendly guide. 
Table 1 lists several good web sites for highly active laboratories in DNA 



[ 344647v I 



7 



WANG 100 
15132/2 



microchip technology, as well as several sources of robotics systems and 
equipment, imaging software and systems and vendors of robotic components. 
The microarrayer 

A turnkey microarrayer can be purchased, with an enclosure for 
5 temperature, humidity and air quality control; a system such as the 
GeneMachines™ OmniGrid (San Carlos, CA) would be sufficient. 
Alternatively, to save on the cost of a robotic system, a microarrayer can be built 
in the laboratory. The Brown Laboratory web site, for example, gives full 
details for component specifications, mechanical drawings for machined parts, a 

10 list of vendors, an assembly guide, and free microarrayer software. 
Operation of the tips, XYZ motion contro l, and computer program 

The robotic gantry of a typical printing tip microarryer is composed of 3 
individual assemblies of linear robotic tables, and motors driven by 3 
corresponding amplifiers which are coupled to a motion controller in the driving 

15 computer. All of this forms the appropriate 3-axis motion control system (i.e.: 
X, Y and Z axes) for microarraying. The three perpendicular axes allow for 
sampling, printing and washing with the components of the microarryer system. 
Printing substrate and samples 

In terms of a printing substrate for producing the microchips, poly~L- 

20 lysine-coated glass slides seem to work best to immobilize the printed DNA. 
Nylon hybridization membranes can also be used as the printing substrate, and 
allow for a much easier immobilization protocol, as well as better visualization 
if a colorimetric method is used for hybridization detection. 

To contain the samples, conical 96-well microplates work well by 

25 localizing small volumes of sample in the wells. When printing many different 
samples, 384-well microplates are best due to their higher capacity and low 
storage volume and the smaller sample sizes (< 10 jal) can be used readily. 
During storage, sample plates should be covered with an adhesive-backed 
plastic seal, to prevent sample loss by evaporation. 
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Sample Preparation 

Samples prepared for printing are loaded into 384-well microplates, 10 
|il aliquots per well These samples can be used for up to 8 to 10 printing runs, 
with proper storage. In printing arrays with the Arraylt™ printing tips on the 
5 GeneMachines™ Omni Grid microarrayer, it is possible to print several thousand 
spots onto one chip either in one array or duplicate arrays on one chip. The 
printing tip delivery volume is approximately 1 nl per spot with a spot diameter 
of approximately 100 |im. Therefore, depending upon the surface area of the 
substrate being used as the chip and the number of tips used for printing, several 

10 large arrays are possible with close spacing (less than 100 um) for up to 100 
chips per run. For typical experiments in this laboratory, arrays are printed in 
duplicate 20x20 arrays per chip with a spot spacing of 250 jam using between 20 
to 30 chips per run. 

To extend the lifetime of the samples, after printing, the microtiter plates 

15 are sealed with adhesive-backed plastic covers in addition to the microplate lids. 
Furthermore, before using the stored samples again, the microplates are 
centrifuged to gather any condensate in the wells, and to localize the sample 
fluids at the bottom of each well. 
Array Analyzer/Imaging system 

20 Depending upon the selected approach to hybridization analysis of the 

printed microarrays, a system fitted onto an existing microscope, a microarray 
scanner or confocal laser scanner may be purchased, or a confocal laser scanner 
may be built. 

The system used to compile the digital microarray images is built around 
25 an Olympus BH-2 upright light microscope, fitted with a Sony color CCD 
camera, an Applied Scientific Instrumentation (Eugene, OR) X-Y scanning 
stage, and a fiber optic ring illuminator from Edmund Scientific Co. (Barrington, 
NJ). EMPIX Imaging, Inc. (Mississauga, ON) assembled the system for 
compiling microarray images, containing a 24 bit frame grabber; it is installed in 
30 a 450 MHz P3 PC equipped with 5 1 2 Mb RAM and a 1 9" SVGA monitor, 
where the image acquisition and system control are governed under the 
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Windows 98 operating system by Northern Eclipse imaging software. A 
3COM™ 10/100 Base TX network card installed in the computer links the 
imaging computer to a small LAN (Lynksys, Irvine, CA), containing a color 
laser printer and two other computers used for image analysis and data storage. 
5 The size of the arrays and individual spots dictates the use of low power 

objectives (either 2.5X or 4X) and the X-Y scanning stage to capture the image 
of the entire array. 

Many of our microarray experiments are done using nylon membranes 
(Hybond-N) as the printing substrate. Probes are labeled with DIG-dUTP in a 

10 reverse transcription reaction; target/probe hybridization is detected with anti- 
DIG-coupled alkaline phosphatase, and a subsequent reaction of the alkaline 
phosphatase with an NBT/BCIP stain/substrate. This method requires the ring 
illuminator to distinguish artifacts from array spots on the stained hybridization 
membranes. Otherwise, if poly-L-lysine coated glass slides are used as the 

15 microarray printing substrate, illumination of the microarray specimen is carried 
out normally. 
Image quantitation 

When the microarray digital imaging routine is completed, the compiled 
montage can be transferred by way of the network to the computer stations 

20 devoted to image analysis and data storage. The microarray images are created 
as TIFF files; before quantitation can begin, the raw digital images are filtered to 
bear only the microarray signal data, aligned in Adobe PhotoShop™ software, 
and then transferred to the GeneAnalyzer microarray analysis software. 
GeneAnalyzer removes the background, and the reduced digital microarray 

25 images are passed through an image location routine to optimally localize the 
spots of the microarray image. When the GeneAnalyzer software has "grabbed" 
the individual spots of the reduced digital microarray image, the program can 
proceed to quantitate the density of the individual spots. Each spot on the 
microarray is then regarded as an individual signal, and its intensity serves as the 

30 foundation of the data needed to reflect the hybridization reaction. After 
comparison with appropriate positive and negative controls for nonspecific 
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reactions, true signal value is subtracted from noise to produce the desired 
information on each hybridization reaction. 

The microarray spot density data are transferred into an analysis routine 
in the mathematical analysis software, MATLAB, for graphical representation 
5 of all data; the density values, as well as the respective calculated values, of all 
digitized microarray data are tabulated in a Microsoft Excel™ spreadsheet. A 
full record of the progression of images, tabulated data and all graphical 
representations can immediately be printed to complete the microarray 
experiment analysis. 

10 Labels for Probes and Detection 

Microarrays typically contain at separate sites nanomolar (less than 
picogram) quantities of individual genes, cDNAs, or ESTs on a substrate such as 
a nitrocellulose or silicon plate, or photolithographically prepared glass 
substrate. The arrays are hybridized to cDNA probes using standard techniques 

15 with gene-specific primer mixes. The nucleic acid to be analyzed — the target 
— is isolated, amplified and labeled, typically with a fluorescent reporter group, 
radiolabel or phosphorous label probe. After the hybridization reaction is 
completed, the array is inserted into the scanner, where patterns of hybridization 
are detected. The hybridization data are collected as light emitted from the 

20 reporter groups already incorporated into the target, which is now bound to the 
probe array. Probes that perfectly match the target generally produce stronger 
signals than those that have mismatches. Since the sequence and position of 
each probe on the array are known, by complementarity, the identity of the 
target nucleic acid applied to the probe array can be determined. 

There are a variety of labels that are used. cDNAs and ESTs can be 
detected by autoradiography or phosphorimaging ( 32 P). Fluorescent dyes are 
also used, and are commercially available from suppliers such as Clontech. 

In the preferred embodiment the label is digoxigenin (DIG). This specific 
enzymatic labeling probe allows the end result of detecting hybridization reaction 
intensity by colorimetric evaluation of alkaline phosphatase-coupled antibody to 
DIG. The enzymatic deposit on each locus of the E-box microarray can be readily 
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analyzed by an upright microscope attached to a CCD camera, without the problem 
of the long delay needed for exposure time with radioactive probes, or the 
photobleaching and high background reaction problem associated with the 
fluorescent probe approach. 
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Table 1: Informative web sites for DNA microarray technology 

DNA microarray technology web sites URL 

Automation and Miniaturization in Genome Analysis, 
Max Plank Institute for Molecular Genetics 

http://www.mpimg-berlin-dahlem.mpg.de/-autom/autom.htm 
Department of Molecular Biotechnology, 
University of Washington 

http://chroma.mbt.washington.edu/mod_www/ 
Functional Genomics Group, 
Albert Einstein College of Medicine 

http://sequence.aecom.yu.edu/bioinf/funcgenomic.html 
Genomics Group, 
Children's Hospital of Philadelphia 

http://w95vcl.neuro.chop.edu/vcheunng 
Laboratory of Cancer Genetics, 
National Human Genome Research Institute 

http://www.nhgri.nih.gov/Intramural_research/Lab_cancer/ 
Joint Genome Institute, 
Lawrence Livermore National Laboratory 

http://llnl.gov/automation-robotics/poster. 1 .html 
Pat Brown Laboratory, 
Stanford University 

http://cmgm.stanford.edu/pbrown 
Stanford DNA sequence and Technology Center 
Stanford University 

http://-sequence.stanford.edu/group/techdev/ 
Microarrayers, imaging systems and scanners 
Applied Scientific Instrumentation, Inc. 

h ttp : //www . A S limagin g . com/ 
Axon Instruments, Inc. 

http://axon.com/GN_Genomics.html 
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Beecher Instruments 

http .7/www.beecherinstrumentsxom/ 
BioDiscovery, Inc. 

http://www.biodiscovery.com/ 
BioRobotics, Ltd. 

http://www.biorobotics.com/ 
Empix Imaging, Inc. 

http ://www. empix . com/ 
GeneMachines, Genomic Instrumentation Services, Inc. 

http://www.genemachines.com/ 
General Microarray Information 

http://www.microan-ay.org/ 
General Scanning, Inc. 

http://www.genscan.com/ 
Genetic MicroSystems, Inc. 

http ://www.geneticmicro.com/ 
Genometrix, Inc. 

http://www.genometrix.com/ 
Genomic Solutions 

http://www.genomicsolutions.com/ 
Imaging Research, Inc. 

http://www.imagingresearch.com/ 
Intelligent Automation 

http : //www . i as . com 
Molecular Dynamics, Inc. 

http://www.mdyn.com/arrays/arraywhat.htm 
Radius Biosciences 

http://www.ultranet.com/-radius 
Research Genetics 

http://www.resgen.com 
ScanAlyze software 
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http://bronzino.stanford.edu/ScanAlyze/ 
Telechem International, Inc. 

http ://www. wenetMelechem/ 
Western Technology Martketing 

http://www.westerntechnology.com/ 

Robotics 

Galil 

http://galilmc.com/ 
Parker-Compumotor 

http ://www.compumotor.com/ 

Parker-Daedal 

http://www.daedalpositioning.com/ 

Example 1: Digoxigenin Enzymatic Detection for Microarray Analysis of 
E-Box Binding Related Gene Expression. 

Realizing the advantages and problems of cDNA microarrays for 
5 expression profiling, in this study a new approach was developed based on 
utilizing digoxigenin (DIG) to label target cDNA produced from gene-specific 
primers, with subsequent incubation with anti-digoxigenin antibody conjugated 
with alkaline phosphatase (AP), and colorimetric or chemi luminescent detection. A 
set of genes containing the E-box binding element (CACGTG), located in 
10 promoter regions of many genes, was selected as the probes. Probably the best- 
known representative of E-box -binding genes is c-Myc, whose transactivating 
activity plays crucial roles in the regulation of cell cycle, proliferation and 
apoptosis (Eilers, M. Mol. Cells 9, 1-6 (1999); Dang, C.V. Mol. Cell Biol. 19, 1-1 1 
(1999); Facchini and Penn FASEB Journal 12,633-651 (1998)). Genes interacting 
1 5 with or regulating expression for c-Myc, as well as some target genes whose 
expression is E-box-binding-dependent, are included in this microarray. These 
custom-designed microarrays, combined with the enzymatic approach to label 
hybridization probes, allow the development of an inexpensive, user- friendly 
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system for high-throughput gene screening assay of specific subgroups of gene 
expressions. 

Materials and Methods 

Selection of pro bes for arraying 
5 E-box-binding proteins, as well as c-Myc-regulating, -interacting and target 

genes, were chosen from different data bases - GeneAtlas 
(http://www.citi2.fr/GENATLAS), GeneCards 
(http://bioinfo.weizmann.ac.il/cards), GenBank 
(http://www.ncbi.nkm.nih.gov/Web/Genbank) and PubMed 

10 (http://www.ncbi.nlm.nih.gov/PubMed). Unigene 

(http://www.ncbi.nlm.nih.gov/UniGene/index.html) cluster numbers and sequences 
were used to identify genes and verify their uniqueness. Nine housekeeping genes, 
as well as HeLa cell DNA were selected as positive controls; as negative controls, 
lambda DNA and 2xSSC (2x standard salt solution - 0.3 M NaCl, 30 mM Na 

1 5 citrate, pH 7.0) were chosen. For each gene, a pair of primers was generated with 
the help of Primer3 software (Rosen and Skaletsky (1998) Primer3. Code available 
a t http://www-genome.wi.mit.edu/ genome_software/other/primer3.htmL ). The 
program parameters were chosen in such a way that the melting temperature of the 
amplicon should be close to 80°C but not more than 88°C or less than 75°C, the 

20 length of the amplicon was to be generally around 450 bp (with a few outlyers 
between 300 and 700 bp), with primer annealing temperature about 60°C, and 
average length of primers 23 bp. Sequences of all amplicons have been carefully 
verified using proprietary software (BLASTN, FASTA), to avoid homology with 
repetitive elements and other related sequences, and also to distinguish between 

25 genes from the same family. A full list of all selected genes is represented in Table 
1. 

DNA, RNA and mRNA isolation 

o 

Total RNA and DNA were isolated from approximately 10 HeLa cell 
cultures and human peripheral lymphocytes isolated from fresh blood aliquots 
30 using Trizol reagent (Gibco BRL, Burlington, ON). DNA and RNA 

concentrations and quality were determined by spectrophotometric and gel 
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electrophoresis analysis in 0.8 or 2% agarose gels, respectively. Poly(A) + RNA 
was isolated from 150 |ng of total RNA using the Oligotex mRNA kit (Qiagen, 
Mississauga, ON), according to the manufacturer's instructions. 
Am plification and purification of probes 
5 1 0 jig of total RNA was reverse-transcribed in 40 jal reaction, using 200 U 

of MMLV (Gibco BRL, Burlington, ON) according to the manufacturer's 
instructions. Two PCR reactions for each pair of primers were conducted in a total 
volume of 100 |il, in a GeneAmp PCR system 9700 (PE Applied Biosystems, 
Norwalk, CT). Each 50 jjl reaction (10 mM Tris-HCl, pH8.6, 50 mM KC1, 0.1% 

1 0 Triton X-100, 1.5 mM MgCl 2 , 0.5 mM of each dNTP, 20 pM of each primer, 1.25 
U of Taq DNA polymerase (Amersham Pharmacia Biotech, Baie d'Urfe, QC) and 
10 \x\ of RT reaction or 100 ng of genomic DNA) was thermal-cycled as follows: 
first cycle at 94°C for 5 min, 35 cycles at 94°C for 45 sec, at 60°C for 1 min and at 
72°C for 30 sec, the last cycle at 72°C for 7 min. Probes that could not be 

1 5 amplified in RT-PCR were extracted from genomic DNA, with the condition that 
the primers were selected in the 3' region of a gene. Size and yield of PCR 
products were determined by gel electrophoresis in 2% agarose. Then PCR 
products were purified from solution or agarose gel bands, following preparative 
agarose gel electrophoresis (if by-products were determined), using GFX columns 

20 (Amersham Pharmacia Biotech, Baie d'Urfe, QC). After purification, 

concentrations of all probes were estimated by agarose gel electrophoresis, and 
adjusted to approximately 100 ng/jil. 
Robotic arraying 

Purified PCR products in 2x standard salt solution (SSC) were arrayed in 
25 triplicates from 384-well plates, utilizing a GeneMachines™ Omni Grid 

microarrayer (Genomic Instrumentation Services, San Carlos, CA) equipped with 
ChipMaker2 tips (Telechem International, San Jose, CA). The spacing between 
dots was 400 jam. The positions of genes in this array are indicated in Table 4. 
Microarrays were printed on Hybond-N or Hybond-N+ nylon membranes 
30 (Amersham Pharmacia Biotech, Baie d'Urfe, QC), attached to standard glass slides 
with tape. Before and after each 1 0 slides with membranes, regular slides were 

17 
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inserted to inspect printing quality. After arraying, membranes were UV irradiated 
at 50 mJ (GS Gene linker, Bio-Rad, Hercules, CA) to immobilize the DNA; then 
fragments of membranes containing arrays (approximately 1 x 1.5 cm) were cut 
off, denaturated in boiling water for 5 min, rinsed in 0.1% SDS for 5 min, and used 
5 for prehybridization. After the UV irradiation step, membranes can be stored 
attached to glass slides. 

Preparation of DIG-labeled cDNA for hybridization 
An initial mix of gene-specific primers (GSP) was produced. For this 
purpose, 1 nM of each primer that was used in RT-PCR reactions to prepare probes 

1 0 was mixed in a total volume of 250 |ul. Digoxigenin (DIG)-labeled targets were 
produced in RT reaction as follows: 1 jil of GSP, 4 \ig of total RNA, and RNAse- 
free water in total volume of 14 \xl were heated at 65°C for 1 5 min to denature the 
RNA, and then kept at room temperature for 5 min for primer annealing. 
Alternatively, 2 jug of mRNA and 400 ng of oligo(dT)i 2 -i8 primers were used. The 

1 5 reaction mix, containing 8 jil of 5x first strand buffer supplied by the enzyme's 
manufacturer, 2 of 10 mM mix of dATP, dCTP and dGTP (final concentration 
500 iiM each), 4 \xL of 0.1 M DDT, 0.7 yl RNAguard, 31 U/jil (Amersham 
Pharmacia Biotech, Baie d'Urfe, QC), 10 ^ of a 2 mM mix of 19:1 dTTP:DIG-l 1- 
dUTP (Roche, Laval, QC) and 2 pi (200 U/pl) of Moloney murine leukemia virus 

20 reverse transcriptase (MMLV RT) (Gibco BRL, Burlington, ON), was added. 

Reaction was carried out at 37°C for 1 h, followed by enzyme degradation at 94°C 
for 5 min in GeneAmp 9700. Alternatively, Omniscript reverse transcriptase 
(Qiagen, Mississauga, ON) was used according to the manufacturer's instructions. 
Labeling reactions were purified on GFX columns; this step eliminates all labeled 

25 products shorter than 100 bp, as well as unincorporated nucleotides, primers and 
protein. 

After purification, efficacy of labeling was estimated as follows: 1 \xl of 
1 : 1 00, 1 : 1 000, 1 : 1 0000 and 1 : 1 00000 dilutions were spotted on Hybond-N 
membrane, together with dilutions of control DIG-labeled DNA at known 
30 concentrations (10-0.01 pg/jj.1) as standardization for our assays (Roche, Laval, 
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QC); after immobilization with UV, the membrane was incubated with alkaline 
phosphatase (AP)-conjugated antibody to DIG (Anti-DIG-AP), rinsed, and stained 
with chemiluminescent substrate, Disodium 3-(4-methoxyspiro{l,2-dioxetane- 
3,2'-(5 , -chloro)tricyclo[33.Ll 3,7 ]decan}-4-yl) phenyl phosphate - CSPD (Roche, 
5 Laval, QC), according to the manufacturer's instructions. 
Hybridization and processing 

For hybridization and pre-hybridization, DIG Easy Hyb buffer (Roche, 
Laval, QC), or formamide buffer containing 50% deionized formamide, 5x SSC, 
2% blocking solution (Roche, Laval, QC), 0.1% N-lauroylsarcosine, 0.02% SDS, 

10 100 |J,g/ml denaturated salmon DNA, were used- Membranes were pre-hybridized 
at 42°Cfor2h in a hybridization oven (Autoblot, Bellco, Vineland, NJ). 
Hybridization was performed at 42°C overnight in 1 ml or less of hybridization 
solution, in 5-ml Falcon tubes. The concentration of labeled probes in the 
hybridization mix constituted 10 ng/ml. Before hybridization the probes were 

1 5 denaturated at 65°C for 10 min in hybridization solution. 

Afterwards, hybridization membranes were rinsed (unless mentioned 
specially) twice with lxSSC, 0.1% SDS for 15 min at room temperature, and then 
withprewarmed O.lxSSC, 0.1% SDS for 15 min at 68°C. Alternatively, 
membranes were rinsed in more stringent conditions, i.e. twice in 2xSSC, 0.1% 

20 SDS at 68°C. for 30 min, and twice in O.lxSSC, 0.1% SDS at 68°C for 30 min. 
After equilibration for 5 min in rinsing buffer (0.3% Tween 20 in maleic buffer 
(0.1 M maleic acid, 0.15 M NaCl, pH 7.5)), membranes were blocked for 1.5 h in 
1% blocking solution under slight agitation, and then treated for 30 min in 10 ml of 
alkaline phosphatase-conjugated sheep anti-digoxigenin antibody (Roche, Laval, 

25 QC), diluted 1 : 1000 for colorimetric staining, or 1 : 10000 for chemiluminescent 
detection. Following antibody incubation, membranes were rinsed three times for 
15 min in rinsing buffer, equilibrated for 2 min in detection buffer (0.1 M Tris- 
HC1, 0.15 M NaCl, pH 9.5), and stained with 175 ^ig/ml 5-Bromo-4-chloro-3- 
indolyl-phosphate, toluidine salt (BCIP), and 330 (ig/ml Nitro blue tetrazolium 

3 0 chloride (NBT) in detection buffer. Alternatively, 1 : 1 00 dilution of CSPD was 
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applied, and chemiluminescence was detected according to the manufacturer's 
recommendations (Roche, Laval, QC) using BioMax MR Kodak film. 
Scanning and evaluation of arrays 

Arrays were scanned on an Olympus microscope equipped with a 
5 Multiscan-4 System (Applied Scientific Instrumentation, Eugene, OR) and a color 
CCD Sony 950 camera. Data acquisition and montage of different fields of view 
into one file were accomplished with the help of the Northern Eclipse Imaging 
System (EMPIX Imaging, Missisauga, ON). Quantitative measurements of 
intensity of enzymatic reaction at each dot, background subtraction, normalization 
1 0 to housekeeping genes, and comparison of paired hybridizations were all 
performed with an in-house software program. 
Results 

Selection of probes and primers 

After careful evaluation of different data bases, 61 genes were selected for 

15 arraying, including 9 housekeeping genes. This set of genes contains 38 E-box 
binding genes, together with the Myc (c-, N-, LI and L2) family, 5 c-Myc 
regulating factors (ZFP161, nm23-H2S, MBP-1, RBMS 1 and RBMS2), 5 c-Myc 
interacting genes (YY1, TFII-1, PAM, MM-1 and alpha-tubulin), and 4 c-Myc 
target genes (prothymosin alpha, MRDB, ODC1, and cdc25A). Positive controls 

20 include 9 housekeeping genes with different levels of expression (UBC, beta-actin, 
GADPH, HPRT1, phospholipase 2, HLA-C, PRS9, aldolase C, and RPL13A), and 
also HeLa genomic DNA. Lambda DNA and 2xSSC (2x standard salt solution), 
which was used as solvent for all probes, were selected as negative controls. 
Primers for all genes were selected with the help of Primer3 software, 

25 provided that they corresponded to the same conditions for PCR reaction, and 

produced products of similar melting temperature. Most products were produced 
from HeLa or lymphocyte cDNA. In case PCR amplification failed from cDNA, 
primers were selected in 3' region of these genes, and amplicons were produced 
from HeLa genomic DNA. The average annealing temperature of primers was 

30 60. 1±0.9°C, which allowed all PCR reactions to be in the 96-well format. Sizes 
and melting temperatures of products, and annealing temperatures of primers, are 

20 
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represented in Table 4. The average size of PCR products for arraying, and their 
melting temperature, were 441±58 bp and 80±3°C, respectively. Selecting these 
parameters allowed hybridization and post-hybridization rinsing in stringent 
conditions, decreasing drastically the possibility of cross-hybridization and 
5 background level. 

Scrupulous selection of primers could be used to distinguish in some cases 
between very close members of gene families (for example, USF1 and 2, ID2, 3 
and 4, members of the Myc family, and so on), or between two different transcripts 
of c-Myc. As is well known, there are several different transcription forms of c- 

1 0 Myc, transcribed from different promoters, with varying regulation properties 
(Bodescot and Brison Gene 174, 1 15-120 (1996)). Selecting primers in the 1 st 
exon and the 2 nd -3 ,d exons allowed discrimination between full-size and truncated 
forms of c-Myc. 

Conditions influencing hybridization 

1 5 Several parameters which probably influence the results of hybridization 

with cDNA microarrays printed on nylon membranes were carefully tested. First 
of all, gene profiling results were examined using either mRNA or total HeLa 
RNA, Surprisingly, the whole pattern of expression was very similar, with the 
exception of a few genes (UBC, RPL-13A, MBP-1) the signals from mRNA were 

20 several times higher; the most prominent difference was found in UBC, where it 
approached 5-fold. Alternatively, signals for HPRT1 and phospholipase A2 were 
higher with total RNA. In conditions where quantity of mRNA is a limiting factor, 
total RNA can be used instead, without significant differences in results of 
expression profiling, 

25 Comparison of two reverse transcription enzymes, Moloney murine 

leukemia virus (MMLV) (Gibco BRL, Burlington, ON) and OmniScript (Qiagen, 
Mississauga, ON), used for production of digoxigenin-labeled targets for 
hybridization, did not reveal any difference in expression profile when gene- 
specific primers were used; but signal intensity was stronger after labeling with 

30 MMLV, especially after a day of staining (Table 5). When oligo(dT) primers were 
used with mRNA, some significant differences in expression levels of several 
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genes were detected. Labeling with OmniScript produced 2-3 times more intense 
signals for RP-S9, RP-L13A, enolasel, N-Myc and MAD4. 

To decide which buffer is better for hybridization with microarrays, we 
compared EasyHyb (Roche, Laval, QC) and formamide-based buffers. The 
5 expression profile of HeLa mRNA was found to be independent of buffer 
composition, but signals were higher after hybridization in formamide buffer 
(Table 2), and addition of 2% blocking reagent further reduced background in 
comparison with EasyHyb, thereby facilitating subsequent scanning and image 
evaluation. 

10 No substantial differences were found in expression profile of HeLa 

mRNA when rinsing conditions of different stringency were used (see Materials 
and Methods). More stringent rinsing evenly lowered all signals, and produced 
signals with sharper borders, rendering them easier to scan and evaluate. Standard 
rinsing conditions are probably already stringent enough in hybridizations with 

15 cDNA microarrays and gene-specific primers; therefore standard rinsing is 
preferred, because it is not so time-consuming. 

Comparison of positively charged (Hybond-N+) with neutral (Hybond-N) 
nylon membranes revealed no differences in sensitivity. Aside from this 
consideration, the neutral (Hybond-N) nylon membrane is preferable due to its 

20 stronger texture for printing support. This strength was not found in the positively 
charged Hybond-N+ membrane, which was found to retain visible printing 
footprints, causing complications in image analysis and increased background. 

As maybe seen from Table 5, increasing the staining time from overnight 
to 1 day usually increased the overall strength of signals by only 10%. Longer 

25 staining time increased the background level of the reaction, which compromised 
the possible advantage of higher sensitivity. Variations in hybridization conditions 
can increase overall signal intensity by 30-40%. However, the positive effects are 
not additive, and the maximum difference in total intensity of microarrays 
approaches only 50%. The following conditions for hybridization of DIG-labeled 

30 targets with the cDNA microarray are optimal: printing probes on neutral nylon 
membrane, reverse transcription reaction with total RNA, gene-specific primers 
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and MMLV reverse transcriptase, hybridization in formamide buffer, and standard 
rinsing conditions. These conditions were implemented in the experiments 
described in the following paragraphs. 

Specificity, sensitivity and reproducibility of hybridization 
5 To evaluate the specificity of cDNA microarray hybridization, 5 genes 

(MRDB, ODC, TFII-1, cdc25A and c-Myc), covering the entire range of length 
(368-71 1 bp) of arrayed products, were labeled in multiplex PCR reaction and 
hybridized with cDNA arrays. As expected, only 5 samples on the array were 
positive, as well as the HeLa genomic DNA as control since it will hybridize with 

1 0 the locus where HeLa genomic DNA was spotted at the highest concentration at 
the position 51, and negative show little or no detection at the positions la and lb 
where spotted HeLa genomic DNA is of low quantity, In all, these experiments 
demonstrate no signs of cross-hybridization (Figure 3). 

To estimate the sensitivity and derive a calibration curve for cDNA 

1 5 microarray hybridization, different concentrations of this 5-gene PCR mix (10, 4, 
1, 0.4, 0.1 and 0,04 ng/ml) were hybridized with arrays. The results of this 
experiment are presented in Figure 4. Linear dependence in semi-logarithmic 
coordinates, with an obvious plateau in the region of 4-10 ng/ml, was observed for 
all genes, with the same slope of 45±2. The lower limit of detection varies slightly 

20 for different probes in the array, and corresponds to 40- 1 00 pg/ml per individual 
gene. These results are close to the detection limit of the digoxigenin system (10- 
30 pg/ml), according to the manufacturer (Roche, Laval, QC). This level of 
sensitivity allows detection of mRNAs of intermediate abundance, each 
representing more than 0.04% of total cell mRNA. Taking into account this 

25 detection level, it is estimated that for hybridization with a microarray containing 
about 70 genes of intermediate abundance, 7 ng of labeled probe produced from 
gene-specific primers should suffice. For the next hybridizations, a concentration 
of labeled probes of 10 ng/ml was selected. The yield of standard reverse 
transcription labeling reaction with gene specific primers is about 20-40 ng; 

30 therefore, one labeling reaction yields enough product for 2-4 independent 

hybridization reactions. In contrast to unstable radioactive probes, DIG-labeled 
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probes can be stored and reused several times. Reusing hybridization mixes 2-3 
times, after storing at -20° C for several months, gave results quite concordant with 
the original ones. 

The arrays were scanned at a resolution of 3600 dpi, and results were 
5 compared with results of microscope scanning. In general, variability between 
replicated dots was higher in the case of the scanner, and linearity may be 
influenced by the scanner's software. The scanner can be used for initial 
evaluation of hybridization results, especially when chemilumenescence detection 
is implemented. 

10 Expression profiling of Hela cells in comparison with human lymphocytes 

Expression profiles of E-box genes were determined in replicating HeLa 
cells and normal human lymphocytes. In lymphocytes, the most prominent 
alteration consisted of more than 2-fold up-regulation of E-box-related genes 
TCF4, MAD4 and Aldolase C. Alternatively, down-regulation of c-Myc- 

1 5 regulating genes MBP1 and Nm23-H2S, and small down-regulation of c-Myc and 
up-regulation of N-Myc, were registered in lymphocytes in comparison with HeLa 
cells. Expression of some c-Myc interacting and target genes was down- (MM-1, 
ODC1) or up-regulated (PAM, MrDb) in lymphocytes. Also, small up- (MITF, 
ID2) and down- (TFEB) regulation was detected in expression of several E-box- 

20 binding genes in lymphocytes, in comparison with HeLa cells. 
Summary 

cDNA and oligonucleotide microarrays are becoming an increasingly 
powerful technique for investigating gene expression patterns. In spite of the fast 
progress in this field, some limitations of the technique persist. One of the major 

25 obstacles is the requirement for a large amount of mRNA. Another problem with 
existing microarray systems is data mining; while information on expression of 
tens of thousands genes is absolutely vital to estimate the functions of new genes, 
in some instances a researcher is interested in the expression profile of only a 
subset of genes, in many physiological conditions. The significant differences in 

30 expression of 3-6 genes out of 61 are already much more manageable than can be 
detected from ordinary microarrays with massive numbers of genes, in the 
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hundreds or thousands. For example, in SAGA analysis of 45,000 genes, it was 
found that about only 1% are differentially expressed in normal and cancerous 
human cells. A similar estimation resulted from analysis of expression profiles in 
young and old mice; expressions of only 1.8% of about 6,000 genes are changed 
more than 2-fold. 

Printing microarrays on nylon filters, and using digoxigenin to label the 
cDNA with gene-specific primers, permits use of as little as 4 jug of total RNA per 
hybridization. This is the same sensitivity that can be attained with radioactivity in 
the Clontech protocol, and it is much more sensitive than ordinary microarrays, 
which need several \ig of mRNA. In addition, DIG-labeled probes of high labeling 
sensitivity can be stored for a long time, and reused several times, in contrast to 
fluorescently or radioactively labeled ones. 

Proprietary selection of genes for inclusion in a microarray, and using 
digoxigenin for labeling, also helps avoid another disadvantage of radioactive 
labeling: genes in the E-box microarray are all in the same category of abundance 
(intermediate or low abundant). Excluding highly abundant genes eliminates the 
problem of merging of strong signals. Merged signals in some circumstances 
substantially complicate the process of scanning, and create unreliable results 
during the data acquisition step. 
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